A Comparative Study of Various Distance Measures for Software fault prediction

نویسنده

  • Deepinder Kaur
چکیده

Different distance measures have been used for efficiently predicting software faults at early stages of software development. One stereotyped approach for software fault prediction due to its computational efficiency is K-means clustering, which partitions the dataset into K number of clusters using any distance measure. Distance measures by using some metrics are used to extract similar data objects which help in developing efficient algorithms for clustering and classification. In this paper, we study K-means clustering with three different distance measures Euclidean, Sorensen and Canberra by using datasets that have been collected from NASA MDP (metrics data program) .Results are displayed with the help of ROC curve. The experimental results shows that K-means clustering with Sorensen distance is better than Euclidean distance and Canberra distance. Keywords— Distance measures; K-means clustering; Fault prediction; Euclidean distance; Sorensen distance; Canberra distance.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

An Empirical Investigation of Predicting Fault Count, Fix Cost and Effort Using Software Metrics

Software fault prediction is important in software engineering field. Fault prediction helps engineers manage their efforts by identifying the most complex parts of the software where errors concentrate. Researchers usually study the faultproneness in modules because most modules have zero faults, and a minority have the most faults in a system. In this study, we present methods and models for ...

متن کامل

Evaluation of land use change, modeling and prediction of areas susceptible to physical development of the city (Case Study: Nurabad Mamasani Town)

Natural parameters are one of the main determinants of the physical development of cities and settlements. In a mountainous area, the effects of these factors have become a barrier to development and can have natural hazards. In this research, it is tried to identify the optimal directions of physical development of the city of Nurabad as a relatively high region by identifying its effective fa...

متن کامل

Evaluation of Classifiers in Software Fault-Proneness Prediction

Reliability of software counts on its fault-prone modules. This means that the less software consists of fault-prone units the more we may trust it. Therefore, if we are able to predict the number of fault-prone modules of software, it will be possible to judge the software reliability. In predicting software fault-prone modules, one of the contributing features is software metric by which one ...

متن کامل

A GIS-based comparative study of the analytic hierarchy process, bivariate statistics and frequency ratio methods for landslide susceptibility mapping in part of the Tehran metropolis, Iran

The high hillsides of the Tehran metropolis are prone to landslides due to the climatic conditions and the geological, geomorphologicalcharacteristics of the region. Therefore, it is vitally important that a landslide susceptibility map of the region be prepared. For thispurpose, thematic layers including landslide inventory, lithology, slope, aspect, curvature, distance to stream, distance to ...

متن کامل

An Efficient Software Fault Prediction Model using Cluster based Classification

Predicting fault -prone software components is an economically important activity due to limited budget allocation for software testing. In recent years data mining techniques are used to predict the software faults .In this research, we present a cluster based fault prediction classifiers which increases the probability of detection. The expectation from a predictor is to get very high probabi...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1411.7474  شماره 

صفحات  -

تاریخ انتشار 2014